Chapter 1 

Self-energy-functional theory 

Michael Potthoff 



Abstract S 

elf-energy-functional theory is a formal framework which allows to de- 
rive non-perturbative and thermodynamically consistent approximations for 
lattice models of strongly correlated electrons from a general dynamical vari- 
ational principle. The construction of the sclf-cncrgy functional and the cor- 
responding variational principle is developed within the path-integral formal- 
ism. Different cluster mean-field approximations, like the variational cluster 
approximation and cluster extensions of dynamical mean-field theory are de- 
rived in this context and their mutual relationship and internal consistency 
are discussed. 



1.1 Motivation 

The method of Green's functions and diagrammatic perturbation theory [1] 
represents a powerful approach to study systems of interacting electrons 
in the thermodynamical limit. Several interesting phenomena, like sponta- 
neous magnetic order, correlation-driven metal-insulator transitions or high- 
temperature superconductivity, however, emerge in systems where electron 
correlations are strong. Rather than starting from the non-interacting Fermi 
gas as the reference point around which the perturbative expansion is devel- 
oped, a local perspective appears to be more attractive for strongly correlated 
electron systems, in particular for prototypical lattice models with local in- 
teraction, such as the famous Hubbard model [2H1]: 
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H = Y^ Uj4a(^ja + ^Y1 • (1-1) 

The local part of the problem, i.e. the Hubbard atom, can be solved easily 
since its Hilbert space is small. It is therefore tempting to start from the 
atomic limit and to treat the rest of the problem, the "embedding" of the 
atom into the lattice, in some approximate way. The main idea of the so- 
called Hubbard-I approximation [5] is to calculate the one-electron Green's 
function from the Dyson equation where the self-energy is approximated by 
the self-energy of the atomic system. This is one of the most simple embedding 
procedures. It already shows that the language of diagrammatic perturbation 
theory. Green's functions and diagrammatic objects, such as the self-energy, 
can be very helpful to construct an embedding scheme. 

The Hubbard-I approach turns out to be a too crude approximation to 
describe the above-mentioned collective phenomena. One of its advantages, 
however, is that it offers a perspective for systematic improvement: Nothing 
prevents us to start with a more complicated "atom" and employ the same 
trick: Wc consider a partition of the underlying lattice with L sites (where 
L — oo) into L/Lc disconnected clusters consisting of Lc sites each. If 
is not too large, the self-energy of a single Hubbard cluster is accessible by 
standard numerical means [S] and can be used as an approximation in the 
Dyson equation to get the Green's function of the full model. This leads to 
the cluster perturbation theory (CPT) [Sl[7]. 

CPT can also be motivated by treating the Hubbard interacton U and 
the mter-cluster hopping F as a perturbation of the system of disconnected 
clusters with mira-cluster hopping t' . The CPT Green's function is then 
obtained by summing the diagrams in perturbation theory to all orders in U 
and V but neglecting vertex corrections which intermix U and V interactions. 

While these two ways of deriving CPT are equivalent, one aspect of the 
former is interesting: Taking the self-energy from some reference model (the 
cluster) is reminiscent of dynamical mean-field theory (DMFT) [SHTO] where 
the self-energy of an impurity model approximates the self-energy of the 
lattice model. This provokes the question whether both, the CPT and the 
DMFT, can be understood in single unifying theoretical framework. 

This question is one motivation for the topic of this chapter on self-energy- 
functional theory (SET) [TTHT4] . Another one is that there are certain defi- 
ciencies of the CPT. While CPT can be seen as a cluster mean-field ap- 
proach since correlations beyond the cluster extensions are neglected, it not 
self-consistent, i.e. there is no feedback of the resulting Green's function on 
the cluster to be embedded (some ad hoc element of self-consistency is in- 
cluded in the original Hubbard-I approximation). In particular, there is no 
concept of a Weiss mean field and, therefore, CPT cannot describe different 
phases of a thermodynamical system nor phase transitions. Another related 
point is that CPT provides the Green's function only but no thermodynam- 
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ical potential. Different ways to derive e.g. the free energy from the Green's 
function pQITSmB] give inconsistent results. 

To overcome these deficiencies, a self-consistent cluster-embedding scheme 
has to be set up. Ideally, this results from a variational principle for a gen- 
eral thermodynamical potential which is formulated in terms of dynamical 
quantities as e.g. the self-energy or the Green's function. The variational 
formulation should ensure the internal consistency of corresponding approx- 
imations and should make contact with the DMFT. This sets the goals of 
self-energy-functional theory and also the plan of this chapter. 



1.2 Self-energy functional 

1.2.1 Hamiltonian, grand potential and self-energy 

We consider a system of electrons in thermodynamical equilibrium at tem- 
perature T and chemical potential /i. The Hamiltonian of the system H = 
H{t, U) = Hoit) + Hi{U) consists of a non- interacting part specified by one- 
particle parameters t and an interaction part with interaction parameters 
U: 



The index a refers to an arbitrary set of quantum numbers labelling an 
orthonormal basis of one-particle states \a). As is apparent from the form of 
H, the total particle number N is conserved. 

The grand potential of the system is given by fit.u = In Zt,u where 



is the partition function. The dependence of the partition function (and of 
other quantities discussed below) on the parameters t and U is made explicit 
through the subscripts. 

The one-particle Green's function Gapii^) = {{cal cjj)) of the system is the 
main object of interest. It will provide the static expectation value of the 
one-particle density matrix cj^c^ and the spectrum of one-particle excitations 
related to a photoemission experiment |17j . The Green's function can be 
defined for complex uj via its spectral representation: 




(1.2) 



Zt,u = trpt,!/ with pt.u = cxp{-{H{t, U) - iiN)/T) 



(1.3) 




(1.4) 
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where the spectral density Aa/3{z) = cxp{izt)Aai3{t) is the Fourier 

tranform of 

^Mt) = ^{K{t),cliO)]+) , (1.5) 

which involves the anticommutator of an annihilator and a creator with a 
Heisenberg time dependence 0{t) = exp{i{H — iJ,N)t)0 exp{—i{H — fiN)t). 

Due to the thermal average, (O) = ti{pt^uO)/Zt^u, the Green's function 
depends on t and U and is denoted by Gt,u,ai3{'^)- For the diagram technique 
employed below, we need the Green's function on the imaginary Matsubara 
frequencies uj = icj„ = i{2n + l)7rT with integer n [1]. In the following the 
elements Gt^u.api^'-^) are considered to form a matrix Gt^u which is diagonal 
with respect to n. 

The "free" Green's function Gt,o is obtained for U — 0, and its elements 
arc given by: 

1 



Gt,o,ap{i^^n) = -. ; 7 • (1-6) 



1^1 — t 



a/3 



Therewith, we can define the self-energy via Dyson's equation 

Gt,u = , ^ , (1.7) 
Gtfi - ^t.U 

i.e. St.u = G^Q — G^Ij. The full meaning of this definition becomes clear 
within the context of diagrammatic perturbation theory [T]. 

Here, we like to list some important properties of the self-energy only: 
(i) Via Dyson's equation, it determines the Green's function, (ii) The self- 
energy has a spectral representation similar to Eq. (jl.4[) . (iii) In particular, 
the coresponding spectral function (matrix) is positive definite, and the poles 
of St,u arc on the real axis [18] . (iv) (uj) — ii a or (3 refer to one-particle 
orbitals that are non-interacting, i.e. if a or /3 do not occur as an entry of the 
matrix of interaction parameters U. Those orbitals or sites arc called non- 
interacting. This property of the self-energy is clear from its diagrammatic 
representation, (v) If a refers to the sites of a Hubbard-type model with 
local interaction, the self-energy can generally be assumed to be more local 
than the Green's function. This is corroborated e.g. by explicit calculations 
using weak-coupling perturbation theory [T5H^ and by the fact that the 
self-energy is purely local on infinite-dimensional lattices j51l22). 



1.2.2 Luttinger-Ward functional 

We would like to distuinguish between dynamic quantities, like the self- 
energy, which is frequency-dependent and related to the (one-particle) excita- 
tion spectrum, on the one hand, and static quantities, like the grand potential 
and its derivatives with respect to ^, T, etc. which are related to the ther- 
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modynamics, on the other. A Hnk between static and dynamic quantities is 
needed to set up a variational principle which gives the (dynamic) self-energy 
by requiring a (static) thermodynamical potential be stationary, There are 
several such relations [T1 [T51[T5] . The Luttinger- Ward (LW) functional (Pu [G] 
provides a special relation with several advantageous properties |23j : 

(i) <l'u[G\ is a functional. Functionals A ~ A[- ■ ■] are indicated by a hat 
and should be distinguished clearly from physical quantities A. 

(ii) The domain of the LW functional is given by "the space of Green's 
functions" . This has to be made more precise later. 

(iii) If evaluated at the exact (physical) Green's function, Gt,u, of the 
system with Hamiltonian H = H{t, U), the LW functional gives a quan- 
tity 

$u[Gt,u] = 'Pt,u (1-8) 
which is related to the grand potential of the system via: 

^t^u - ^t,u + Tr In Gt,[/ - TiiS^uGuu) ■ (1-9) 

Here the notation TrA = T^^^ e'^'^"^^ Aaa{i<^n) is used. 0+ is a pos- 
itive infinitesimal. 



(iv) The functional derivative of the LW functional with respect to its 
argument is: 

= (1.10) 

Clearly, the result of this operation is a functional of the Green's function 
again. This functional is denoted by S since its evaluation at the physical 
(exact) Green's function Gt,u yields the physical self-energy: 

^[Gt,u] = St.u ■ (1.11) 

(v) The LW functional is "universal": The functional relation 'Pu[ • •] is 
completely determined by the interaction parameters U (and docs not 
depend on t). This is made explicit by the subscript. Two systems (at 
the same chemical potential fi and temperature T) with the same in- 
teraction U but different one-particle parameters t (on-site energies and 
hopping integrals) are described by the same Luttinger- Ward functional. 
Using Eq. (|1.10p . this implies that the functional IJu[G] is universal, too. 



(vi) Finally, the LW functional vanishes in the non-interacting limit: 



$u[G] = for U = 



(1.12) 
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Fig. 1.1 Original definition of the Luttinger-Ward functional <l>u[G], see Ref. | 23| . Double 
lines: fully interacting propagator G. Dashed lines: interaction U. 



1.2.3 Diagrammatic derivation 



can be constructed order by order in diagrammatic perturbation theory. The 
functional is obtained as the limit of the infinite series of closed renormal- 
ized skeleton diagrams, i.e. closed diagrams without self-energy insertions 
and with all free propagators replaced by fully interacting ones (see Fig. 
II. ip . There is no known case where this skeleton-diagram expansion could be 
summed up to get a closed form for (PulG]. Therefore, the exphcit functional 
dependence is unknown even for the most simple types of interactions like 
the Hubbard interaction. 

Using the classical diagrammatic definition of the LW functional, the prop- 
erties (i) - (vi) listed in the previous section are easily verified: By construc- 
tion, ^u[G] is a functional of G which is unisersal (properties (i), (ii), (v)). 
Any diagram is the series depends on U and on G only. Particularly, it is 
independent of t. Since there is no zeroth-order diagram, <l>u\G\ trivially 
vanishes for 17 = 0, this proves (vi). 

Diagrammatically, the functional derivative of (Pu[G] with respect to G 
corresponds to the removal of a propagator from each of the <P diagrams. 
Taking care of topological factors [HHS], one ends up with the skeleton- 
diagram expansion of the self-energy (iv). Therefore, Eq. is obtained 
in the limit of this expansion. 

Eq. ()1.9|) can be derived by a coupling-constant integration [23]. Alterna- 
tively, it can be verified by integrating over ^: We note that the /x derivative 
of the l.h.s and of the r.h.s of Eq. ()1.9|) are equal for any fixed interaction 
strength t, U and T. Namely, {d/dn){<Pt u + TrlnGt u - TriJ* uGt u) = 
TTG;l{dGt,u/dtx) - TTGt,u{dSt,u/d'ti) = -Tr Gt^u = -{Nhu = 
df2t,u /dfi. Here, we have used Eq. (|1.3|) in the last step and Eq. (|1.7|) . Eq. 
(|1.8p and Eq. (jl.lO|) before. {N)t,u is the grand-canonical average of the total 
particle- number operator. Integration over /z then yields Eq. ()1.9|) . Note that 
the equation holds trivially for fj, -co, i.e. for {N)t,u since St,u = 
and 'Pt,u = in this limit. 



In the original paper by Luttinger and Ward 
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1.2.4 Derivation using the path integral 

For the diagrammatic derivation it has to be assumed that the skeleton- 
diagram series is convergent. It is therefore interesting to see how the LW 
functional can be defined and how its properties can be verified within a path- 
integral formulation. This is non-perturbative. The path-integral construction 
of the LW functional was first given in Rcf . [M] . 

Using Grassmann variables (TB] £,a{i^n) = ^^/^ c?r e*""'^^Q(r) and 
if* (ia;,i) = T^/^ /o^^^ '^'''^"'""^^^(t), the elements of the Green's function are 
given by Gt,u ,ap{iuJn) = ~{5,a{ii^n)^*3{i^n))t,u ■ The average 

Gt.i7,Q/3(«WTi) = /^C-Dr^a(*w„)^«(ii:j„)exp (At, (1-13) 

Zt.u J 

is defined with the help of the action At,u,^^* = ^t,CC* + -^u,^^* where 

^t,t/,??* = ^ Caij''^n){{ii^n + M)'5a/3 " tafi)£.p{ii^n) + ^I7,{{* (1-14) 

and 

Au.ii' =-\Y. ^"^^7 dr Cc.{T)Cp{r)i^{T)^8{r) . (1.15) 

This is the standard path-integral representation of the Green's function |16| . 

The action can be considered as the physical action that is obtained when 
evaluating the functional 

71, a/3 

at the (matrix inverse of the) physical free Green's function, i.e. 

Using this, we define the functional 

r?t/[G(7'] = -T\nZu[G^^] (1.18) 

with 

Zu[G^']^ j DiDCc^^(AuMG^']) ■ (1.19) 

The functional dependence of i7[7[G(^^] is determined by U only, i.e. the 
functional is universal. Obviously, the physical grand potential is obtained 
when inserting the physical inverse free Green's function Gi^: 
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f2u[Gll] - Ot^u ■ (1.20) 
The functional derivative of Eq. (|1.18p leads to another universal functional: 

with the property 

ac/[G,J] = Gt,f/. (1.22) 

This is easily seen from Eq. (|1.13p . 

The strategy to be pursued is the following: Gu[Gq^] is a universal {t 
independent) functional and can be used to construct a universal relation 
S = Su[G] between the self-energy and the one-particle Green's function 
- independent from the Dyson equation (jl.7p . Using this and the universal 
functional HuIGq^], a universal functional ^u[G] can be constructed, the 
derivative of which essentially yields IJu[G] and that also obeys all other 
properties of the diagrammatically constucted LW functional. 

To start with, consider the equation 

gu[G-^ + S] = G. (1.23) 

This is a relation between the variables S and G which for a given G, may 
be solved for S. This defines a functional Z!u[G], i.e. 

QuiG-' - Su[G]] = G . (1.24) 

For a given Green's function G, the self-energy S = Sij[G] is defined to be 
the solution of Eq. (|1.23p . From the Dyson equation (fTT7|) and Eq. (|1.22p it 
is obvious that the relation (|1.23l) is satisfied for the physical S = St,u and 
the physical G = Gt^u of the system with Hamiltonian Ht,u'- 

i:u[Gt,u] = ^t.u . (1.25) 

This construction simplifies the original presentation in Ref. j24j . The discus- 
sion on the existence and the uniqueness of possible solutions of the relation 
(|1.23p given there applies accordingly to the present case. 

With the help of the functionals Qij[Gq^] and Su[G\, the Luttinger-Ward 
functional is obtained as: 

$u[G] = Qu[G-^ + Su[G]] -TvlnG + i:v{Eu[G]G) . (1.26) 



Let us check property (iv). Using Eq. ()1.2ip one finds for the derivative of 
the first term: 
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7f --yyu.iSiG +2^[/ — \ — 

T dGaP ^ \0Gafi oGctp J 

(1.27) 

and, using Eq. (|1.24p . 

1 5Qu[G-^ + EulG]] _ ^ (5G~l 5Su^^\\ 

T ig::^ --2^G..(^^ + ^^j . (1.28) 

Therewith, 

r^G^T^ ^""^ 5Gap ^TSG7p [-^^'-G + TriEu[G]G)) 

(1.29) 

SuMi^n)[G] , (1.30) 



and, finally, 

1 S$u[G] 



T SGapiiuJn 

where, as a reminder, the frequency dependence has been reintroduced. 

The other properties are also verified easily, (i) and (ii) are obvious, (iii) 
follows from Eq. p.20|) . Eq. ()1.22p and Eq. (|1.25p and the Dyson equation 
(|1.7p . The universality of the LW functional (v) is ensured by the presented 
construction. It involves universal functionals only. Finally, (vi) follows from 
Gu=o[G~'^] = G which implies (via Eq. pT^ ) £[/=o[G] = 0, and with 
Qu=o[G-^] = TrlnG we get $u=o[G] = 0. 



1.2.5 Variational principle 

The functional Su [G] can be assumed to be invertible locally provided that 
the system is not at a critical point for a phase transition (see also Rcf. [llj). 
This allows to construct the Legendre transform of the LW functional: 

Puis] = $u[Gu[S]] - Tr{SGu[S]) . (1.31) 

Here, Gu[Su[G]] = G. With Eq. (fOOl) we immediately find 

1 SFu\S] 



T SS 

We now define the self-energy functional: 



-Gum ■ (1-32) 



ntM^] = Trln / + Fu[S] . (1.33) 
^t.o ^ ^ 
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Its functional derivative is easily calculated: 



The equation 



1^%^^^^ Gum. (1.34) 



Gu[i:] = (1.35) 



is a (highly non-linear) conditional equation for the self-energy of the system 
H = Hoit) + Hi{U). Eqs. (fTTl) and (fL25l) show that it is satisfied by the 
physical self-energy S = St,u- Note that the l.h.s of (|1.35p is independent of 
t but depends on U (universality of Gu[^])^ while the r.h.s is independent 
of U but depends on t via G^q . The obvious problem of finding a solution of 

Eq. (jl.35|) is that there is no closed form for the functional Gu[^]- Solving 
Eq. ()1.35p is equivalent to a search for the stationary point of the grand 
potential functional of the self-energy: 

= . (1.36) 
This is the starting point for self-energy-functional theory. 



1.2.6 Approximation schemes 

Up to this point we have discussed exact relations only. It is clear, however, 
that it is generally impossible to evaluate the self-energy functional Eq. (|1.33p 
for a given S and that one has to resort to approximations. Three different 
types of approximation strategies may be distinguished: 

In a type-I approximation one derives the Euler equation 6f2t,u[^]/S^ = 
first and then chooses (a physically motivated) simplification of the equation 
afterwards to render the determination of St,u possible. This is most general 
but also questionable a priori, as normally the approximated Euler equation 
no longer derives from some approximate functional. This may result in ther- 
modynamical inconsistencies. 

A type-II approximation modifies the form of the functional dependence, 
^t,u[' • •] ^ ^t^ui' ' ■]' S^^ ^ simpler one that allows for a solution of the 
resulting Euler equation S n'j.^^lZ!] / S S ~ 0. This type is more particular and 
yields a thcrmodynamical potential consistent with St.u- Generally, however, 
it is not easy to find a sensible approximation of a functional form. 

Finally, in a type-Ill approximation one restricts the domain of the func- 
tional which must then be defined precisely. This type is most specific and, 
from a conceptual point of view, should be preferred as compared to type-I or 



1 Sclf-onergy-functional theory 



11 



type-II approximations as the exact functional form is retained. In addition to 
conceptual clarity and thermodynamical consistency, type-III approximations 
are truely systematic since improvements can be obtained by an according 
extension of the domain. 

Examples for the different cases can be found e.g. in Ref. [14]. The classifi- 
cation of approximation schemes is hierarchical: Any type-III approximation 
can always be understood as a type-II one, and any type-II approximations 
as type-I, but not vice versa. This does not mean, however, that type-III ap- 
proximations are superior as compared to type-II and type-I ones. They are 
conceptually more appealing but do not necessarily provide "better" results. 
One reason to consider self-energy functional instead of functional of the 
Green's function (see Refs. [1S1[55], for example), is to derive the DMFT as 
a type-III approximation. 



1.3 Variational cluster approach 

The central idea of self-energy-functional theory is to make use of the uni- 
versality of (the Legendre transform of) the Luttinger-Ward functional to 
construct type-III approximations. Consider the self-energy functional Eq. 
(|1.33p . Its first part consists of a simple explicit functional of S while its 
second part. i.e. Fu[^], is unknown but depends on U only. 



1.3.1 Reference system 

Due to this universality of Fu [S] , one has 

a,^t/[^]-Trln / +Fu[S] (1.37) 

for the self-energy functional of a so-called "reference system" . As compared 
to the original system of interest, the reference system is given by a Hamil- 
tonian H' = -ffo(t') + Hi{U) with the same interaction part U but modified 
one-particle parameters t' . The reference system has different microscopic 
parameters but is assumed to be in the same macroscopic state, i.e. at the 
same temperature T and the same chemical potential /z. By a proper choice 
of its one-particle part, the problem posed by the reference system H' can 
be much simpler than the original problem posed by H. We assume that the 
self-energy of the reference system St'.u can be computed exactly, e.g. by 
some numerical technique. 

Combining Eqs. (|1.33p and (|1.37p . one can eliminate the unknown func- 
tional Fu[S]: 
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Qt^u[S\ = +Trln— ^— -Trln— ^ . (1.38) 

It appears that this amounts to a shift of the problem only as the self-energy 
functional of the reference system again contains the full complexity of the 
problem. In fact, except for the trivial case U — Q, the functional dependence 
of Qt' .u[^] is unknown - even if the reference system is assumed to be 
solvable, i.e. if the self-energy St',U: the Green's function Gt'^u £ind the 
grand potential f2t' ,u of th^ reference system are available. 

However, inserting the self-energy of the reference system St',u hito the 
self-energy functional of the original one, and using f2t' .u[^t' ,u] = ^t'.u and 
the Dyson equation of the reference system, we find: 

nt,u[^t',u] = ^t'.u + Tr In ^ Tr In Gf.c/ ■ (1.39) 

This is a remarkable result. It shows that an exact evaluation of the self-energy 
functional of a non-trivial original system is possible, at least for certain self- 
energies. This requires to solve a reference system with the same interaction 
part. 



1.3.2 Domain of the self-energy functional 

Eq. (|1.39p provides an explict expression of the self-energy functional Slt^u [^] ■ 
This is suitable to discuss the domain of the functional precisely. Take U to 
be fixed. We define the space of t-representable self-energies as 

Su = {S\3t: S = St,u}- (1.40) 

This definition of the domain is very convenient since it ensures the correct 
analytical and causal properties of the variable S. 

We can now formulate the result of the preceeding section in the following 
way. Consider a set of reference systems with U fixed but different one- 
particle parameters t', i.e. a space of one-particle parameters T' ■ Assume 
that the reference system with H' = Hf.u can be solved exactly for any 
t' G T' ■ Then, the self-energy functional ]7t,c/[-^] can be evaluated exactly 
on the subspace 

Si = {E I 3t' e r' : Et'^u} C Su ■ (1.41) 
This fact can be used to construct type-Ill approximations, see Fig. 11.21 
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/L = Z(t) 




Fig. 1.2 Schematic illustration for the construction of consistent approximations within 
the SFT. The grand potential is considered as a functional of the self-energy which is 
parametrized by the one-particle parameters t of the Hamiltonian ([/ is fixed). At the 
physical self-energy, fi is stationary (filled red circles). The functional dependence of Q 
on S is not accessible in the entire space of self-energies {E space) but on a restricted 
subspace of "trial" self-energies parametrized by a subset of one-particle parameters t' 
(solid red lines) corresponding to a "reference system", i.e. a manifold of systems with 
the same interaction part but a one-particle part given by t' which renders the solution 
possible. The grand potential can be evaluated exactly on the submanifold of reference 
systems. A stationary point on this submanifold represents the approximate self-energy 
and the corresponding approximate grand potential (open circle). 

1.3.3 Construction of cluster approximations 

A certain approximation is defined by a choice of the reference system or 
actually by a manifold of reference systems specified by a manifold of one- 
particle parameters 7"'. As an example consider Fig. 11.31 The original system 
is given by the one-dimensional Hubbard model with nearest-neighbor hop- 
ping t and Hubbard interaction U . A possible reference system is given by 
switching off the hopping between clusters consisting of Lc = 2 sites each. 
The hopping within the cluster t' is arbitrary, this defines the space T' ■ The 
self-energies in 5', the corresponding Green's functions and grand potentials 
of the reference system can obviously be calculated easily since the degrees 
of freedom are decoupled spatially. Inserting these quantities in Eq. (|1.39p 
yields the SFT grand potential as a function of t': 




(1.42) 



This is no longer a functional but an ordinary function of the variational 
parameters t' G 7"'. The final task then consists in finding a stationary point 
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t^pt of this function: 

^ = fort'^t^p,. (1.43) 

In the example considered this is a function of a single variable t' (we as- 
sume t' to be the same for all clusters) . Note that not only the reference sys- 
tem (in the example the isolated cluster) defines the final result but also the 
lattice structure and the one-particle parameters of the original system. These 
enter f2{t') via the free Green's function Gt,o of the original system. In the 
first term on the r.h.s of Eq. (jl.39p we just recognize the CPT Green's func- 
tion 1/(G^Q — St',u)- The approximation generated by a reference system 
of disconnected clusters is called variational cluster approximation (VGA). 

An example for the results of a numerical calculation is given in Fig. 11.41 
see also Ref. [57] . The calculation has been performed for the one-dimensional 
particle-hole symmetric Hubbard model at half-filling and zero temperature. 
The figure shows the numerical results for the optimal nearest-neighbor intra- 
cluster hopping t' as obtained from the VGA for a reference system with 
disconnected clusters consisting of Lc = 10 sites each. The hopping t' is as- 
sumed to be the same for all pairs of nearest neighbors. In principle, one could 
vary all one-particle parameters that do not lead to a coupling of the clusters 
to get the optimal result. In most cases, however, is it necessary to restrict 
oneself to a small number of physically motivated variational parameters to 
avoid complications arising from a search for a stationary point in a high- 
dimensional parameter space. For the example discussed here, the parameter 
space T' is one-dimensional only. This is the most simple choice but more 
elaborate approximations can be generated easily. The fiexibility to construct 
approximations of different quality and complexity must be seen as one of 
the advantages of the variational cluster approximation and of the SFT in 
general. 

As can be seen from the figure, a non-trivial result, namely t'^^^ ^ t = I, is 
found for the optimal value of t^p^. We also notice that topt > t. The physical 
interpretation is that switching off the mier-cluster hopping, which generates 
the approximate self-energy, can partially be compensated for by enhancing 
the mira-cluster hopping. The effect is the more pronounced the smaller is 



U t t t 

H 
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U 

Fig. 1.3 Variational cluster approximation (VGA) for the Hubbard model. Top: represen- 
tation of the original one-dimensional Hubbard model H with nearest-neighbor hopping t 
and Hubbard-interaction U. Bottom: reference system H' consisting of decoupled clusters 
of Lc = 2 sites each with intra-cluster hopping t' as a variational parameter. 
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Fig. 1.4 (taken from Ref. [27]). SFT ground-state energy per site, i.e. (n(t') + ^i{N)) / L, 
as a function of the intra-cluster nearest-neighbor hopping t' for Lc = 10 and different 
U [fi = U/2) at zero temperature. Arrows indicate the respective optimal t' . The energy 
scale is fixed by t = 1. 



the cluster size Lc- Furthermore, it is reasonable that in case of a stronger 
interaction and thus more localized electrons, switching off the inter-cluster 
hopping is less significant. This can be seen in Fig. If .41 The largest optimal 
hopping t'^pf. is obtained for the smallest U. 

On the other hand, even a "strong" approximation for the self-energy 
(measured as a strong deviation of t'^p^. from t) becomes irrelevant in the 
weak- coupling limit because the self-energy must vanish for U = 0. Generally, 
we note that the VGA becomes exact in the limit C/ = 0: In Eq. (|1.38p the 
first and the third terms on r.h.s cancel each other and we are left with 

nt,u=o[S] = Trln J . (1.44) 

Since the trial self-energy has to be taken from a reference system with the 
same interaction part, i.e. U = and thus S = 0, the limit becomes trivial. 
For weak but finite U, the SFT grand potential becomes flatter and flatter 
until for U = the t' dependence is completely irrelevant. 

The VGA is also exact in the atomic limit or, more general and again 
trivial, in the case that there is no restriction on the trial self-energies: S' = S. 
In this case, t'^^^ = t solves the problem, i.e. the second and the third term on 

the r.h.s of Eq. (|1.39p cancel each other and ilt^ui^f ,u] — ^f.u for t' = t. 

Gluster-perturbation theory (GPT) can be understood as being identical 
with the VGA provided that the SFT expression for the grand potential is 
used and that no parameter optimization at all is performed. As can be seen 
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from Fig. II. 4[ there is a gain in binding energy due to the optimization of t', 
i.e. /2(tQpj) < f2{t). This means that the VGA improves on the CPT resuh. 

Fig. 11.51 shows the ground-state energy (per site) , i.e. the SFT grand po- 
tential at zero temperature constantly shifted by fj,N at the stationary point, 
as a function of the inverse cluster size l/Lc- The dependence turns out to be 
quite regular and allows to recover the exact Bethe-Ansatz result (BA) [5S] 
by extrapolation to 1/Lc = 0. The VGA result represents a considerable im- 
provement as compared to the "direct" cluster approach where Eq is simply 
approximated by the ground-state energy of an isolated Hubbard chain (with 
open boundary conditions). Gonvergence to the exact result is clearly faster 
within the VGA. Note that the direct cluster approach, opposed to the VGA, 
is not exact for U = 0. 

In the example discussed so far a single variational parameter was taken 
into account only. More parameters can be useful for different reasons. For 
example, the optimal self-energy provided by the VGA as a real-space clus- 
ter technique artificially breaks the translational symmetry of the original 
lattice problem. Finite-size effects are expected to be the most pronounced 
at the cluster boundary. This suggests to consider all intra-cluster hopping 
parameters as independent variational parameters or at least the hopping at 
the edges of the chain. 

The result is shown in Fig. 11.61 We find that the optimal hopping varies 
between different nearest neighbors within a range of less than 10%. At the 
chain edges the optimal hopping is enhanced to compensate the loss of itin- 
erancy due to the switched-off inter-cluster hopping within the VGA. With 
increasing distance to the edges, the hopping quickly decreases. Quite gener- 
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Fig. 1.6 (taken from Ref. 1271 V Optimized tiopping parameters for the reference systems 
shown in Fig. I1.3l but for larger clusters with Lc sites each as indicated. VGA results for 
J7 = 4, t = 1 at half-filling and temperature T = 0. Black: hopping assumed to be uniform. 
Red: two hopping parameters varied independently, the hopping at the two cluster edges 
and the "bulk" hopping. Blue: hopping at the edges, next to the edges and bulk hopping 
varied. Green: four hopping parameters varied. Orange: five hopping parameters varied. 



ally, the third hopping parameter is already elose to the physical hopping t. 
Looking at the Lc = 10 results where all (five) different hopping parameters 
have been varied independently (orange circles), one can see the hopping to 
slightly oscillate around the bulk value reminiscent of surface Friedel oscilla- 
tions. 

The optimal SFT grand potential is found to be lower for the inhomoge- 
neous cases as compared to the homogeneous (black) one. Generally, the more 
variational parameters are taken into account the higher is the decrease of 
the SFT grand potential at optimal parameters. However, the binding-energy 
gain due to inhomogeneous hopping parameters is much smaller compared to 
the gain obtained with a larger cluster. 

Considering an additional hopping parameter tpbc linking the two chain 
edges as a variational parameter, i.e. clusters with periodic boundary condi- 
tions always gives a minimal SFT grand potential at ipbc = (instead of a 
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stationary point at <pbc = !)• This implies that open boundary conditions 
are preferred (see also Ref. [13)). 



1.4 Consistency of approximations 

1.4-1 Analytical structure of the Green's function 

Constructing approximations within the framework of a dynamical varia- 
tional principle means that, besides an approximate thermodynamical poten- 
tial, approximate expressions for the self-energy and the one-particle Greens 
function are obtained. This raises the question whether their correct analyti- 
cal structure is respected in an approximation. For approximations obtained 
from self-energy-functional theory this is easily shown to be the case in fact. 

The physical self-energy Sap{uj) and the physical Green's function Gap{uj) 
are analytical functions in the entire complex w plane except for the real axis 
and have a spectral representation (see Eq. (|1.4p ) with non- negative diagonal 
elements of the spectral function. 

This trivially holds for the SFT self-energy St',u. api^^) since by construc- 
tion Sf ,u ,ai3{^) is the exact self-energy of a reference system. The SFT 
Green's function is obtained from the SFT self-energy and the free Green's 
function of the original model via Dyson's equation: 

G=-—r^—^. (1.45) 

It is easy to see that it is analytical in the complex plane except for the real 
axis. To verify that is has a spectral representation with non-negative spectral 
function, we can equivalently consider the corresponding retarded quantity 
Gi.ot('^) = G(a;-|-iO''') for real frequencies uj and verify that Grot = Gr — iGj 
with Gr, Gi Hermitian and Gj non-negative: 

We can assume that Go, ret — Go,r — iGoj with Go,r, Gq.i Hermitian and 
Go, I non-negative. Since for Hermitian matrices A, B with B non-negative, 
one has 1/(A ± iB) = X =f iY with X, Y Hermitian and Y non-negative 
(see Ref. [12]), we find G(^ret = Pk + ^Pi with Pr, Pi Hermitian and Pi non- 
negative. Furthermore, we have ^ret = — i^i with X'r, Si Hermitian 
and Si non-negative. Therefore, 



1 1 



iP\ - Sn + iSi Qr, + iQi 



(1.46) 



with Qr Hermitian and Qi Hermitian and non-negative. 

Note that the proof given here is simpler than corresponding proofs for 
cluster extensions of the DMFT [291130] because the SFT does not involve a 
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"self-consistency condition" which is the main object of concern for possible 
causality violations. 



1.4-2 Thermodynamical consistency 

An advantegeous feature of the VGA and of other approximations within 
the SFT framework is their internal thermodynamical consistency. This is 
due to the fact that all quantities of interest are derived from an approxi- 
mate but explicit expression for a thermodynamical potential. In principle 
the expectation value of any observable should be calculated by via 

W ^ 1^ , (1.47) 

where n = n{t') is the SFT grand potential (see Eq. pr42)) ) at t' = t^pt and 
is a parameter in the Hamiltonian of the original system which couples 
linearly to A, i.e. Ht u = + XaA. This ensures, for example, that the 
Maxwell relations 

are respected. 

Furthermore, thermodynamical consistency means that expectation values 
of arbitrary one-particle operators A — J2ap ^afscl^Cfj can consistently either 
be calculated by a corresponding partial derivative of the grand potential on 
the one hand, or by integration of the one-particle spectral function on the 
other. As an example we consider the total particle number = X^qcJ^Cq,. 
A priori it not guarateed that in an approximate theory the expressions 

(N) = , (1.49) 



and 



/oo 
f{z)A^^{z)dz (1.50) 
-CJO 



with /(z) = l/(exp(z/r) + 1) and the spectral function ^q,^(z) will give the 
same result. 

To prove thermodynamic consistency, we start from Eq. (|1.49p . According 
to Eq. (|1.39|) . there is a twofold n dependence of i7 = nt,u[St'^^^.u]- (i) 
the explicit /x dependence due to the chemical potential in the free Green's 
function of the original model, G^q = cj + /i — t, and (ii) an implicit n 
dependence due to the ^ dependence of the self-energy ^ jj, the Green's 
function Gt' ^,J7 and the grand potential flf ^ jj of the reference system: 
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(„) = _J^-/£L. (1,1) 

Note that the implicit ^ dependence is due to the chemical potential of the 
reference system which, by construction, is in the same macroscopic state as 
the original system as well as due to the /i dependence of the stationary point 
t^pt itself. The latter can be ignored since 

for t' = t^p^. because of stationarity condition Eq. (|1.43p . 

Wc assume that an overall shift of the one-particle energies e' = t'^^ is 
included in the set T' of variational parameters. Apart from the sign this 
is essentially the "chemical potential" in the reference system but should be 
formally distinguished from /i since the latter has a macroscopic thermody- 
namical meaning and is the same as the chemical potential of the original 
system which should not be seen as a variational parameter. 

The self-energy, the Green's function and the grand potential of the refer- 
ence system are defined as grand-canonical averages. Hence, their ^ depen- 
dence due to the grand-canonical Hamiltonian %' = H' — jiN is (apart from 
the sign) the same as their dependence on e': Consequently, we have: 

dQ _ dQ _^ 
dfiim. de' 

due to the stationarity condition again. 

We are then left with the explicit ^ dependence only: 

dQ d 1 1 , , 

= Trln = -Tr— J . (1.54) 

c'/^ox. c'Mcx. G^Q — St'^^^,u Q — St'^^^.u 

Converting the sum over the Matsubara frequencies implicit in the trace Tr 
into a contour integral in the complex lo plane and using Cauchy's theorem, 
we can proceed to an integration over real frequencies. Inserting into Eq. 
(|1.5ip . this yields: 



dio (1.55) 



for t' ~ i^pj which is just the average particle number given by (jl.50p . This 
completes the proof. 
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1.4-3 Symmetry breaking 



The above discussion has shown that besides intra-cluster hopping parame- 
ters it can also be important to treat one-particle energies in the reference 
cluster as variational parameters. In particular, one may consider variational 
parameters which lead to a lower symmetry of the Hamiltonian. 

As an example consider the Hubbard model on a bipartite lattice as the 
system of interest and disconnected clusters of size Lc as a reference sys- 
tem. The reference-system Hamiltonian shall include, e.g., an additional a 
staggered magnetic-field term: 

HLt. = B'Yl ^»("'t - "4) ' (1-56) 

where Zi ^ +1 for sites on sublattice 1, and = — 1 for sublatticc 2. The 
additional term H^^^. leads to a valid reference system as there is no change of 
the interaction part. We include the field strength B' in the set of variational 
parameters, B' & T'- 

B' is the strength of a ficticious field or, in the language of mean-field 
theory, the strength of the internal magnetic field or the Weiss field. This 
has to be distinguished clearly from an external physical field applied to the 
system with field strength B: 

Hphys. = B^^Zi{n^^ - nil) (1-57) 

This term adds to the Hamiltonian of the original system. 

We expect B'^^^. = in case of the paramagnetic state and B = (and 
this is easily verified numerically). Consider the B' and B dependence of 
the SFT grand potential Q{B\B) ~ flsl^B']- Here we have suppressed 
the dependencies on other variational parameters t' and on t, U . Due to the 
stationarity condition, dQ{B\B)/dB' = 0, the optimal Weiss field B' can 
be considered as a function of _B, i.e. B^pj = B'{B). Therefore, we also have: 

ddnmBlB)_^^ (1.58) 



dB dB' 



This yields: 



d^[2iB'{B),B)dB'{B) d^Q{B'{BlB) _ 

dB~^ dBdB' ^^■^^> 



Solving for dB' /dB we find: 



22 



Michael Potthoff 



This clearly shows that B' = B'^^^ has to be interpreted carefully. B' can be 
much stronger than B if the curvature d^f2/dB'^ of the SFT functional at 
the stationary point is small, i.e. if the functional is rather flat as it is the 
case in the limit [/ — > 0, for example. 

Form the SFT appoximation for the staggered magnetization, 

m = j:-din.t - .4)) « ^m'iB),B) = ^^^^'[31^) ^^^^^ 

where the stationarity condition has been used once more, we can calculate 
the susceptibility, 

_ dm _ d^n{B'{B),B) dB'iB) d^n{B'{B),B) 
^~'dB~ dB'dB dB~^ dB^ ' ' 



Using Eq. p:Ml) . 



dB^ \dB'^J KdB'dB 



(1.63) 



We see that there are two contributions. The first term is due to the explicit 
B dependence in the SFT grand potential while the second is due to the im- 
plicit B dependence via the B dependence of the stationary point. Eq. (|1.63p 
also demonstrates (see Ref. [31]) that for the calculation of the paramagnetic 
susceptibility x one may first consider spin-independent variational parame- 
ters only to find a stationary point. This strongly reduces the computational 
effort [32] ■ Once a stationary point is found, partial derivatives according to 
Eq. (|1.63p have to calculated with spin-dependent parameters A in a single 
final step. 

Spontaneous symmetry breaking is obtained at _B = if there is a sta- 
tionary point with B'^^^. ^ 0. Fig. 11.71 gives an example for the particle- hole 
symmetric Hubbard model on the square lattice at half-filling and zero tem- 
perature. As a reference system a cluster with Lc = 10 sites is considered, 
and the ficticious staggered magnetic field is taken as the only variational 
parameter. There is a stationary point at B' = which corresponds to the 
paramagnetic phase. At B' = the usual cluster-perturbation theory is re- 
covered. The two equivalent stationary points at finite B' correspond to a 
phase with spontaneous antiferromagnetic order - as expected for the Hub- 
bard model in this parameter regime. The antiferromagnetic ground state 
is stable as compared to the paramagnetic phase. Its order parameter m is 
the conjugate variable to the ficticious field. Since the latter is a variational 
parameter, m can either be calculated by integration of the spin-dependent 
spectral density or as the derivative of the SFT grand potential with respect 
to the physical field strength B with the same result. More details are given 
in Ref. [33l. 
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Fig. 1.7 (taken from Ref. SFT grand potential as a function of the stength of a 

ficticious staggered magnetic field B' . VGA calculation using disconnected clusters con- 
sisting of Lc = 10 sites each for the two-dimensional Hubbard model on the square lattice 
at half-filling, zero temperature, U = 8 and nearest-neighbor hoppint i = 1. 



The possibility to study spontaneous symmetry breaking using the VGA 
with suitably chosen Weiss fields as variational parameters has been exploited 
frequently in the past. Besides antiferromagnetism [55^55] . spiral phases [57] . 
ferromagnetism [35], d-wave superconductivity P5t444| . charge order 
and orbtial order |37] have been investigated. The fact that an explicit ex- 
pression for a thermodyanmical potential is available allows to study discon- 
tinuous transitions and phase separation as well. 



1.4-4 Non-perturbative conserving approximations 

Continuous symmetries of a Hamiltonian imply the existence of conserved 
quantities: The conservation of total energy, momentum, angular momen- 
tum, spin and particle number is enforced by a not explicitly time-dependent 
Hamiltonian which is spatially homogeneous and isotropic and invariant un- 
der global SU(2) and U(l) gauge transformations. Approximations may arti- 
ficially break symmetries and thus lead to unphysical violations of conserva- 
tions laws. Baym and Kadanoff [JSIIH] have analyzed under which circum- 
stances an approximation respect the mentioned macroscopic conservation 
laws. Within diagrammatic perturbation theory it could be shown that ap- 
proximations that derive from an explicit but approximate expression for the 
LW functional 'P ("(^-derivable approximations") are "conserving". Exam- 
ples for conserving approximations are the Hartree-Fock or the fluctuation- 
exchange approximation [481150]. 
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The SFT provides a framework to construct (^-derivable approximations 
for correlated lattice models which are non-perturbative, i.e. do not employ 
truncations of the skeleton-diagram expansion. Like in weak-coupling con- 
serving approximations, approximations within the SFT are derived from 
the LW functional, or its Legendre transform Fu[S]. These are (^-derivable 
since any type-Ill approximation can also be seen as a type-II one, see Section 

For fermionic lattice models, conservation of energy, particle number and 
spin have to be considered. Besides the static thermodynamics, the SFT 
concentrates on the one-particle excitations. For the approximate one-particle 
Green's function, however, it is actually simple to prove directly that the 
above conservation laws are respected. A short discussion is given in Ref. [51] . 

At zero temperature T — there is another non-trivial theorem which 
is satisfied by any <?-derivable approximation, namely Luttinger's sum rule 
This states that at zero temperature the volume in reciprocal space 
that is enclosed by the Fermi surface is equal to the average particle number. 
The original proof of the sum rule by Luttinger and Ward [23] is based on the 
skeleton-diagram expansion of ^ in the exact theory and is straightforwardly 
transferred to the case of a <?-derivable approximation. This also implies that 
other Fermi-liquid properties, such as the linear trend of the specific heat 
at low T and Fermi-liquid expressions for the T = charge and the spin 
susceptibility are respected by a (^-derivable approximation. 

For approximations constructed within the SFT, a different proof has to 
be found. One can start with Eq. ()1.39p and perform the zero-temperature 
limit for an original system (and thus for a reference system) of finite size L. 
The different terms in the SFT grand potential then consist of finite sums. 
The calculation proceeds by taking the y^t-derivative, for T = 0, on both sides 
of Eq. (|1.39|) . This yields the following result (see Ref. [51] for details): 

{N) = {NY + 2j20iG^m - 2^6)(G',(0)) . (1.64) 

fc k 

Here (N) {{N)') is the ground-state expectation value of the total particle 
number N in the original (reference) system, and Gfc(O) (GJ,(0)) are the diag- 
onal elements of the one-electron Green's function G at a; = 0. As Luttinger's 
sum rule reads 

(iV)=2^e(G',(0)), (1.65) 

k 

this implies that, within an approximation constructed within the SFT, the 
sum rule is satisfied if and only if it is satisfied for the reference system, i.e. 
if (N)' = 2 6'(G^(0)). This demonstrates that the theorem is "propa- 
gated" to the original system irrespective of the approximation that is con- 
structed within the SFT. This propagation also works in the opposite di- 
rection. Namely, a possible violation of the exact sum rule for the reference 
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system would imply a violation of the sum rule, expressed in terms of ap- 
proximate quantities, for the original system. 

There are no problems to take the thermodynamie limit L — > oo (if de- 
sired) on both sides of Eq. p.64p . The k sums turn into integrals over the 
unit cell of the reeiprocal lattice. For a ZJ-dimensional lattice the D ~ 1- 
dimensional manifold of k points with G'fc(O) = c» or Gk(0) = form Fermi 
or Luttinger surfaces, respectively. Translational symmetry of the original as 
well as the reference system may be assumed but is not necessary. In the ab- 
sence of translational symmetry, however, one has to re-interprete the wave 
vector k as an index which refers to the elements of the diagonalized Green's 
function matrix G. The exact sum rule generalizes accordingly but can no 
longer be expressed in terms of a Fermi surface since there is no recipro- 
cal space. It is also valid for the case of a translationally symmetric original 
Hamiltonian where, due to the choice of a reference system with reduced 
translational symmetries, such as employed in the VGA, the symmetries of 
the (approximate) Green's function of the original system are (artificially) 
reduced. Since with Eq. ()1.64p the proof of the sum rule is actually shifted 
to the proof of the sum rule for the reference system only, we are faced with 
the interesting problem of the validity of the sum rule for a finite cluster. 
For small Hubbard clusters with non-degenerate ground state this has been 
checked numerically with the surprising result that violations of the sum rule 
appear in certain parameter regimes close to half- filling, see Ref. |51j . This 
leaves us with the question where the proof of the sum rule fails if applied 
to a system of finite size. This is an open problem that has been stated and 
discussed in Refs. [5T1[S3] and that is probably related to the breakdown of 
the sum rule for Mott insulators [54] . 



1.5 Bath degrees of freedom 

1.5.1 Motivation and dynamical impurity 
approximation 

Within self-energy-functional theory, an approximation is specified by the 
choice of the reference system. The reference system must share the same in- 
teraction part with the original model and should be amenable to a (numer- 
ically) exact solution. These requirements restrict the number of conceivable 
approximations. So far we have considered a decoupling of degrees of freedom 
by partitioning a Hubbard-type lattice model into finite Hubbard clusters of 
Lc sites each, which results in the variational cluster approximation (VGA). 

Another element in constructing approximation is to add degrees of free- 
dom. Since the interaction part has to be kept unchanged, the only possibility 
to do that consists in adding new uncorrelated sites (or "orbitals"), i.e. sites 
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Fig. 1.8 Different possible reference systems with the same interaction as the single-band 
Hubbard model on a square lattice. Filled circles: correlated sites with \J as in the Hub- 
bard model. Open circles: uncorrelated "bath" sites with t/ = 0. Lines: nearest-neighbor 
hopping. Big circles: continuous bath consisting of Lb = oo bath sites. Reference systems 
^A' ^B' generate variational cluster approximations (VGA), H'^ yields dynamical 
impurity approximation (DIA), H'^ the DMFT, and H'^ an intermediate approximation 
(VGA with one additional bath site per cluster). 



where the Hubbard- C/ vanishes. These are called "bath sites". The coupling 
of bath sites to the correlated sites with finite U in the reference system via 
a one-particle term in the Hamiltonian is called "hybridization" . 

Fig. II .81 shows different possibilities. Reference system A yields a trial self- 
energy which is local Sija-{io) ~ SijIJ{Lu) and has the same pole structure as 
the self-energy of the atomic limit of the Hubbard model. This results in a 
variant of the Hubbard-I approximation [2J . Reference systems B and C gen- 
erate variational cluster approximations. In reference system D an additional 
bath site is added to the finite cluster. Reference system E generates a local 
self-energy again but, as compared to A, allows to treat more variational pa- 
rameters, namely the on-site energies of the correlated and of the bath site 
and the hybridization between them. We call the resulting approximation a 
"dynamical impurity approximation" (DIA) with Lf, = 1. 

The DIA is a mean-field approximation since the self-energy is local which 
indicates that non-local two-particle correlations, e.g. spin-spin correlations, 
do not feed back to the one-particle Green function. It is, however, quite 
different from static mean-field (Hartree-Fock) theory since even on the Lb = 
1 level it includes retardation effects that result from processes cx where 
the electron hops from the correlated to the bath site and back. This improves, 
as compared to the Hubbard-I approximation, the frequency dependence of 
the self-energy, i.e. the description of the temporal quantum fluctuations. 
The two-site {Lb = 1) DIA is the most simple approximation which is non- 
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perturbative, conserving, thermodynamically consistent and which respects 
the Luttinger sum rule (see Ref. [51]). Besides the "atomic" physics that 
leads to the formation of the Hubbard bands, it also includes in the most 
simple form the possibility to form a local singlet, i.e. to screen the local 
magnetic moment on the correlated site by coupling to the local moment at 
the bath site. The correct Kondo scale is missed, of course. Since the two-site 
DIA is computationally extremely cheap, is has been employed frequently 
in the past, in particular to study the physics of the Mott metal-insulator 
transition [TTllIllSIlliSHSS] • 



1.5.2 Relation to dynamical mean-field theory 

Starting from E and adding more and more bath sites to improve the descrip- 
tion of temporal fluctuations, one ends up with reference system F where a 
continuum of bath sites (Lf, = oo) is attached to each of the disconnected 
correlated sites. This generates the "optimal" DIA. 

To characterize this approximation, we consider the SFT grand poten- 
tial given by Eq. (|1.39p and analyze the stationarity condition Eq. (|1.43p : 
Calculating the derivative with respect to t' , we get the general SFT Euler 
equation: 

^^l^G,J(*c.„)-S,,^(ic.„) "V,. 

(1.66) 

For the physical self-energy I^t,u of the original system Ht^jj, the equa- 
tion was fulfilled since the bracket would be zero. Vice versa, since Gt\u = 
Gij[^t' ,u], the physical self-energy of Ht^u is determined by the condition 
that the bracket be zero. Hence, one can consider the SFT Euler equation 
to be obtained from the exact conditional equation for the "vector" S in 
the self-energy space Su through projection onto the hypersurface of t' rcp- 
resentable trial self-energies by taking the scalar product with vectors 
dSt',u,ai3{i^n)/dt' tangential to the hypersurface. 

Consider now the Hubbard model in particular and the trial self-energies 
generated by reference system F. Actually, F is a set of disconnected single- 
impurity Anderson models (SIAM). Assuming translational symmetry, these 
impurity models are identical replicas. The self-energy of the SIAM is non- 
zero on the correlated ( "impurity" ) site only. Hence, the trial self-energies are 
local and site-independent, i.e. Sija{iLOn) — SijS{iujn), and thus Eq. (jl.66p 
reads: 

( G'(ioj„] 



) dt' " ' ^ 



28 



Michael Potthoff 



where the notation has been somewhat simphfied. The Euler equation would 
be solved if one-particle parameters of the SIAM and therewith an impurity 
self-energy can be found that, when inserted into the Dyson equation of the 
Hubbard model, yields a Green's function, the local element of which is equal 
to the impurity Green's function G'. Namely, the bracket in Eq. (|1.67p . i.e. 
the local (diagonal) elements of the bracket in Eq. (|1.66p . vanishes. Since the 
"projector" dSua{iu}n)/dt' is local, this is sufficient. 

This way of solving the Euler equation, however, is just the prescription to 
obtain the self-energy within dynamical mean- field theory (DMFT) and 
setting the local elements of the bracket to zero is just the self-consistency 
equation of DMFT. We therefore see that reference system F generates the 
DMFT. It is remarkable that with the VGA and the DMFT quite different 
approximation can be obtained in one and the same theoretical framework. 

Note that for any finite L^, as e.g. with reference system E, it is impossible 
to satisfy the DMFT self-consistency equation exactly since the impurity 
Green's function G' has a finite number of poles while the lattice Green's 
function (l/G|7^(«Wn) — ^{iuJn))iirT has branch cuts. Nevertheless, with the 
"help" of the projector, it is easily possible to find a stationary point topt of 
the self-energy functional and to satisfy Eq. (|1.67|) . 

Gonceptually, this is rather different from the DMFT exact-diagonalization 
(DMFT-ED) approach l60| which also solves a SIAM with a finite Lt but 
which approximates the DMFT self-consistency condition. This means that 
within DMFT-ED an additional ad hoc prescription to necessary which, op- 
posed to the DIA, will violate thermodynamical consistency. However, an 
algorithmic implementation via a self-consistency cycle to solve the Euler 
equation is simpler within the DMFT-ED as compared to the DIA [56ll59] . It 
has therefore been suggested to guide improved approximations to the self- 
consistency condition within DMFT-ED by the DIA [61] . The convergence of 
results obtained by the DIA to those of full DMFT with increasing number 
of bath sites Lh is usually fast. With respect to the Mott transition in the 
single-band Hubbard model, Lf, = 3 is sufficient to get almost quantitative 
agreement with DMFT-QMG results, for example [56] . 

Real-space DMFT for inhomogeneous systems [62H64] with a local but 
site-dependent self-energy Eij^jiiuin) = SijEi{iuJn) is obtained from reference 
system F if the one-particle parameters of the SIAM's at different sites are 
allowed to be different. This enlarges the space of trial self- energies. In this 
case we get one local self-consistency equation for each site. The different 
impurity models can be solved independently from each other in each step 
of the self-consistency cycle while the coupling between the different sites is 
provided by the Dyson equation of the Hubbard model. 
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Fig. 1.9 Dynamical impurity and cluster approximations generated by different refer- 
ence systems within SFT. Lc is the number of correlated sites in the reference cluster, 
risj = 1 + Lj, is the number of local degrees of freedom where Lj, denotes the number of 
additional bath sites attached to each of the Lc correlated sites. The variational cluster 
approximation (VGA) is obtained for finite clusters with Lc > 1 but without bath sites 
Ha = 1. This generalizes the Hubbard- 1- type approximation obtained for Lc = I. The 
dynamical impurity approximation (DIA) is obtained for ris > 1 but for a single corre- 
lated site Lc = 1. A continuum of bath sites, = oo generates DMFT [Lc = 1) and 
cellular DMFT (Lc > 1). Lc = oo, irrespective of the number of local degrees of freedom, 
corresponds to the exact solution. 

1.5.3 Cluster mean-field approximations 

As a mean-field approach, the DIA does not include the feedback of non-local 
two-particle correlations on the one-particle spectrum and on the thermody- 
namics. The DIA self-energy is local and takes into account temporal correla- 
tions only. A straightforward idea to include short-range spatial correlations 
in addition, is to proceed to a reference system with ic > 1, i-e. system of 
disconnected finite clusters with Lc sites each. The resulting approximation 
can be termed a cluster mean-field theory since despite the inclusion of short- 
range correlations, the approximation is still mean-field-like on length scales 
exceeding the cluster size. 

For an infinite number of bath degrees of freedom Lf, — >■ oo attached to 
each of the Lc > I correlated sites the cellular DMFT [30ll65] is recovered, see 
Fig. 11.91 Considering a single-band Hubbard model again, this can be seen 
from the corresponding Eulcr equation: 



where the prime at the sum over the sites indicates that i and j must belong 
to the same cluster of the reference system. Namely, IJija{iuJn) = and also 




(1.68) 
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the "projector" dSji„{iijJn)/dt' — if i and j belong to different clusters. 
This stationarity condition can be fulfilled if 



Note that G'(ia;„) is a matrix which is labeled as G^^- j,; (ia;„) where i,j = 
l,...,Lc refer to the correlated sites in the cluster while k,l = l,...,LbLc 
to the bath sites. G[j{iuJn) are the elements of the cluster Green's function 
on the correlated sites. The condition Eq. (|1.69p is just the self-consistency 
condition of the C-DMFT. 

As is illustrated in Fig. 11.91 the exact solution can be obtained with in- 
creasing cluster size Lc ^ oo either from a sequence of reference systems with 
a continuous bath = 1 + Li, ^ oo. corresponding to C-DMFT, or from a 
sequence with rig = 1, corresponding to VGA, or with a finite, small num- 
ber of bath sites ("cellular DIA"). Systematic studies of the one-dimensional 
Hubbard model [T31I17] have shown that the energy gain which is obtained 
by attaching a bath site is lower than the gain obtained by increasing the 
cluster. This suggests that the convergence to the exact solution could be 
faster on the "VGA axis" in Fig. 11.91 For a definite answer, however, more 
systematic studies, also in higher dimensions, are needed. 

In any case bath sites help to get a smooth dependence of physical quan- 
tities when varying the electron density or the (physical) chemical potential. 
The reason is that bath sites also serve as "charge reservoirs", i.e. during a n 
scan the ground state of the reference cluster may stay in one and the same 
sector characterized by the conserved total particle number in the cluster 
while the particle number on the correlated sites and the approximate par- 
ticle number in the original lattice model evolve continuously |27[I5^ . This 
is achieved by a ^-dependent charge transfer between correlated and bath 
sites. In addition, (at least) a single bath site per correlated site in a finite 
reference cluster is also advantageous to include the interplay between lo- 
cal (Kondo-type) and non-local (antiferromagnetic) singlet formation. This 
has been recognized to be important in studies of the Mott transition [66j 
in the two-dimensional and of ferromagnetic order in one-dimensional sys- 
tems |32| . For studies of spontaneous U(l) symmetry breaking, e.g. d-wave 
superconductivity in the two-dimensional Hubbard model |38H41j , doping de- 
pendencies can be investigated without bath sites due to mixing of cluster 
states with different particle numbers. 



1.5.4 Translation symmetry 

For any cluster approximation formulated in real space there is an apparent 
problem: Due to the construction of the reference system as a set of decoupled 




(1.69) 
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clusters, the trial self-energies do not preserve the translational symmetries of 
the original lattice. Trivially, this also holds if periodic boundary conditions 
are imposed for the individual cluster. Transformation of the original problem 
to reciprocal space does not solve the problem either since this also means to 
transform a local Hubbard-type interaction into a non-local interaction part 
which basically couples all fc points. 

There are different ideas to overcome this problem. We introduce a "peri- 
odizing" universal functional 

T\Sh = \X^5.,^,,^'-J'S^'J■ (1-70) 
i'j' 

which maps any trial self-energy onto translationally invariant one. In recip- 
rocal space this corresponds to the substitution Sk.k' ~> T[S]k.k' = Sk,k'Sk- 
Using this, we replace the self-energy functional of Eq. (|1.37|) by 

^i'U^] = Trln ^ ^ + , (1.71) 

as suggested in Ref . [67] . This new functional is different from the original one. 
However, as the physical self-energy is supposed to be translational invariant, 
it is a stationary point of both, the original and the modified functional. This 
means that the modified functional can likewise be used as a starting point 
to construct approximations. It turns out (see Ref. |68] for an analogous 
discussion in case of disorder) that for a reference system with Lc > 1 and 
rig = oo, the corresponding Euler equation reduces to the self-consistency 
equation of the so-called periodized cellular DMFT (PC-DMFT) [69]. The 
same modified functional can also be used to construct a periodized VGA, 
for example. 

While the main idea to recover the PC-DMFT is to modify the form of the 
self-energy functional, the dynamical cluster approximation (DCA) [29II70II71) 
is obtained with the original functional but a modified hopping term in the 
Hamiltonian of the original system. We replace t — ?> t and consider the func- 
tional f2~jj[Z!]. To ensure that the resulting approximations systematically 

approach the exact solution for cluster size Lc — )■ oo, the replacement t — > t 
must be controlled by Lc, i.e. it must be exact up to irrelevant boundary 
terms in the infinite-cluster limit. This is the case for 

t = {VW)UhU{VW)'' , (1.72) 

where U, V, and W are unitary transformations of the one-particle basis. U 
is the Fourier transformation with respect to the original lattice consisting 
of L sites {L X L unitary matrix). W is the Fourier transformation on the 
cluster {Lc x Lc), and V the Fourier transformation with respect to the 
super lattice consisting oiL/Lc supersites given by the clusters {LjLcXLjL^. 
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The important point is that for any finite L^. the combined transformation 
VW = WV ^ (7, while this becomes irrelevant in the limit — > oo. The 
detailed calculation (see Ref. |S5] for the analogous disorder case) shows that 
the DCA is recovered for a reference system with with Lc> 1 and rig — oo, if 
periodic boundary conditions are imposed for the cluster. The same modified 
construction can also be used to a get simplified DCA-type approximation 
without bath sites, for example. This "simplified DCA" is related to the 
periodized VGA in the same way as the DCA is related to the PC-DMFT. 
The simplified DCA would represent a variational generalization of a non- 
self-consistent approximation ("periodic CPT") introduced recently [72] . 



1.6 Systematics of approximations 

Since the SFT unifies different dynamical approximations within a single for- 
mal framework, the question arises how to judge on the relative quality of two 
different approximations resulting from two different reference systems. This, 
however, is not straightforward for several reasons. First, it is important to 
note that a stationary point of the self-energy functional is not necessarily a 
minimum but rather a saddle point in general (see Ref. for an example). 
The self-energy functional is not convex. Actually, despite several recent ef- 
forts [26 1 173 1 174] , there is no functional relationship between a thermodynam- 
ical potential and time-dependent correlation functions. Green's functions, 
self-energies, etc. which is known to be convex. 

Furthermore, there is no a priori reason why, for a given reference system, 
the SFT grand potential at a stationary point should be lower than the SFT 
grand potential at another one that results from a simpler reference system, 
e.g. a smaller cluster. This implies that the SFT does not provide upper 
bounds to the physical grand potential. There is e.g. no proof (but also no 
counterexample) that the DMFT ground-state energy at zero temperature 
must be higher than the exact one. On the other hand, in practical calcula- 
tions the upper-bound property is usually found to be respected, as can be 
seen for the VGA in Fig. 11.51 for example. Nevertheless, the non-convexity 
must be seen as a disadvantage as compared to methods based on wave func- 
tions which via the Ritz variational principle are able to provide strict upper 
bounds. 

To discuss how to compare two approximations within SFT, we first have 
to distinguish between "trivial" and "non-trivial" stationary points for a given 
reference system. A stationary point is referred to as "trivial" if the one- 
particle parameters are such that the reference system decouples into smaller 
subsystems. If, at a stationary point, all degrees of freedom (sites) are still 
coupled to each other, the stationary point is called "non-trivial" . It is pos- 
sible to prove the following theorem [75]: Consider a reference system with 
a set of variational parameters t' = t" + V where V couples two separate 
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subsystems. For example, V could be the inter-cluster hopping between two 
subclusters with completely decouples the degrees of freedom for V = Q and 
aU t". Then, 

f2t,u[^t"+v] = nt,u[^t"+o] + OiV^) , (1.73) 

provided that the functional is stationary at St" .u when varying t" only 
(this restriction makes the theorem non-trivial). This means that going from 
a more simple reference system to a more complicated one with more degrees 
of freedom coupled, should generate a new non-trivial stationary point with 
V ^ while the "old" stationary point with V = being still a stationary 
point with respect to the "new" reference system. Coupling more and more 
degrees of freedom introduces more and more stationary points, and none of 
the "old" ones is "lost" . 

Consider a given reference system with a non-trivial stationary point and 
a number of trivial stationary points. An intuitive strategy to decide between 
two stationary points would be to always take the one with the lower grand 
potential nt,u[St',u]- A sequence of reference systems (e.g. Hj^, H^, Hq, ...) 
in which more and more degrees of freedom are coupled and which eventually 
recovers the original system H itself, shall be called a "systematic" sequence 
of reference systems. For such a systematic sequence, the suggested strategy 
trivially produces a series of stationary points with monotonously decreasing 
grand potential. Unfortunately, however, the strategy is useless because it 
cannot ensure that a systematic sequence of reference systems generates a 
systematic sequence of approximations as well, i.e. one cannot ensure that 
the respective lowest grand potential in a systematic sequence of reference 
systems converges to the exact grand potential. Namely, the stationary point 
with the lowest SFT grand potential could be a trivial stationary point (like 
one associated with a very simple reference system only as Hj^ or iJg in Fig. 
II. 8[ for example). Such an approximation must be considered as poor since 
the exact conditional equation for the self-energy is projected onto a very 
low-dimensional space only. 

Therefore, one has to construct a different strategy which necessarily ap- 
proaches the exact solution when following up a systematic sequence of refer- 
ence systems. Clearly, this can only be achieved if the following rule is obeyed: 
A non-trivial stationary point is always preferred as compared to a trivial one 
(RO). A non-trivial stationary point at a certain level of approximation, i.e. 
for a given reference system becomes a trivial stationary point on the next 
level, i.e. in the context of a "new" reference system that couples at least 
two different units of the "old" reference system. Hence, by construction, the 
rule RO implies that the exact result is approached for a systematic series of 
reference systems. 

Following the rule (RO), however, may lead to inconsistent thermodynamic 
interpretations in case of a trivial stationary point with a lower grand po- 
tential as a non-trivial one. To avoid this, RO has to be replaced by: Trivial 
stationary points must he disregarded completely unless there is no non-trivial 
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Fig. 1.10 Possible trends of the SFT grand potential as a function of a variational 
parameter V coupling two subsystems of a reference system. V = corresponds to the 
decoupled case and must always represent a "trivial" stationary point. Circles show the 
stationary points to be considered. The point V = has to be disregarded in all cases 
except for a). The arrow marks the respective optimum stationary point according to the 
rules discussed in the text. 



one (Rl). This automatically ensures that there is at least one stationary 
point for any reference system, i.e. at any approximation level. 

To maintain a thermodynamically consistent picture in case that there are 
more than a single non-trivial stationary points, we finally postulate: Among 
two non-trivial stationary points for the same reference system, the one with 
lower grand potential has to be preferred (R2). 

The rules are illustrated by Fig. 11.101 Note that the grand potential away 
from a stationary point does not have a direct physical interpretation. Hence, 
there is no reason to intcrpretc the solution corresponding to the maximum 
in Fig. ll.lOi c) as "locally unstable" . The results of Ref. [12] (see Figs. 2 and 4 
therein) nicely demonstrate that with the suggested strategy (Rl, R2) one can 
consistently describe continuous as well as discontinuous phase transitions. 

The rules Rl and R2 are unambiguously prescribed by the general de- 
mands for systematic improvement and for thermodynamic consistency. 
There is no acceptable alternative to this strategy. The strategy reduces to the 
standard strategy (always taking the solution with lowest grand potential) 
in case of the Ritz variational principle because here a non-trivial stationary 
point docs always have a lower grand potential as compared to a trivial one. 

There are also some consequences of the strategy which might be consid- 
ered as disadvantageous but must be tolerated: (i) For a sequence of station- 
ary points that are determined by Rl and R2 from a systematic sequence of 
reference systems, the convergence to the corresponding SFT grand poten- 
tials is not necessarily monotonous, (ii) Given two different approximations 
specified by two different reference systems, it is not possible to decide which 
one should be regarded as superior unless both reference systems belong 
to the same systematic sequence of reference systems. In Fig. II. 8i one has 
H'j^ < < H'q < where "<" stands for "is inferior compared to" . Fur- 
thermore, i/g < H!p and H'p^ < but there is no relation between and 
iJg, for example. 
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1.7 Summary 

The above discussion has shown that self-energy-functional theory provides a 
general framework which allows to construct different dynamical approxima- 
tions for lattice models of strongly correlated electrons. These approximations 
derive from a fundamental variational principle, formulated in terms of the 
grand potential expressed as a functional of the self-energy, by restricting the 
domain of the functional. This leads to non-perturbative and thermodynam- 
ically consistent approximations. The SFT unifies different known approxi- 
mations in a single theoretical frame and provides new dynamical impurity 
(DIA) and variational cluster approximations (VGA). 

The essential step in the numerical evaluation consists in the calculation 
of the Green's function or the self-energy of a reference system with the 
same interaction part as the original model but with spatially decoupled 
degrees of freedom. Details of the numerical procedure can be found in Refs. 
pT | fT2 l [27 p 32 | [76 j . for example. Typically, exact diagonalization or the (band) 
Lanczos approach [77l[72 but also quantum Monte-Garlo techniques may 
be used [7^ as a reference-system solver. Since bath sites can be integrated 
out within Green- function based QMG schemes, QMG as an impurity /cluster 
solver is the method of choice for finite-temperature DMFT or cluster DMFT 
approaches, i.e. for reference systems with a continuum of bath sites. At zero 
temperature, and using reference systems without bath sites or a few bath 
degrees of freedom only, the SFT provides computationally fast techniques 
which complement the (cluster) DMFT methods. 

Besides applications to Hubbard-type model systems, the VGA has re- 
cently been employed to study the correlated electronic structure of real ma- 
terials, such as NiO [80], GoO, MnO [H], LaGoOg [31], TiOGl [82j, GrOa |83], 
TiN [53], and NiMnSb [SS]. Furthermore, the theory has been extended to 
study Bose systems [571I5B] and the Jayncs-Gummings lattice |87I88| . electron- 
phonon systems [55], systems with non-local interactions [89], systems with 
quenched disorder |68| and more. 
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